Overview
Brought to you by YData
Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 5000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 390.8 KiB |
| Average record size in memory | 80.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 1 |
Air Quality is highly overall correlated with CO and 4 other fields | High correlation |
CO is highly overall correlated with Air Quality and 7 other fields | High correlation |
Humidity is highly overall correlated with CO | High correlation |
NO2 is highly overall correlated with Air Quality and 4 other fields | High correlation |
PM10 is highly overall correlated with CO and 2 other fields | High correlation |
PM2.5 is highly overall correlated with PM10 | High correlation |
Population_Density is highly overall correlated with CO and 1 other fields | High correlation |
Proximity_to_Industrial_Areas is highly overall correlated with Air Quality and 6 other fields | High correlation |
SO2 is highly overall correlated with Air Quality and 4 other fields | High correlation |
Temperature is highly overall correlated with Air Quality and 4 other fields | High correlation |
Reproduction
| Analysis started | 2025-01-02 08:17:31.312099 |
|---|---|
| Analysis finished | 2025-01-02 08:18:01.471814 |
| Duration | 30.16 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
Temperature
Real number (ℝ)
High correlation 
| Distinct | 362 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.02902 |
| Minimum | 13.4 |
|---|---|
| Maximum | 58.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 13.4 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 25.1 |
| median | 29 |
| Q3 | 34 |
| 95-th percentile | 42.6 |
| Maximum | 58.6 |
| Range | 45.2 |
| Interquartile range (IQR) | 8.9 |
Descriptive statistics
| Standard deviation | 6.7206614 |
|---|---|
| Coefficient of variation (CV) | 0.22380555 |
| Kurtosis | 0.51316206 |
| Mean | 30.02902 |
| Median Absolute Deviation (MAD) | 4.4 |
| Skewness | 0.75218681 |
| Sum | 150145.1 |
| Variance | 45.167289 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26.8 | 45 | 0.9% |
| 26.7 | 43 | 0.9% |
| 29.4 | 42 | 0.8% |
| 26.3 | 41 | 0.8% |
| 27.4 | 41 | 0.8% |
| 23.6 | 41 | 0.8% |
| 24.6 | 38 | 0.8% |
| 32.2 | 38 | 0.8% |
| 27.8 | 37 | 0.7% |
| 26.2 | 36 | 0.7% |
| Other values (352) | 4598 |
| Value | Count | Frequency (%) |
| 13.4 | 1 | |
| 14.1 | 1 | |
| 14.4 | 1 | |
| 15.3 | 1 | |
| 15.4 | 1 | |
| 15.5 | 1 | |
| 16 | 1 | |
| 16.1 | 1 | |
| 16.4 | 2 | |
| 16.5 | 1 |
| Value | Count | Frequency (%) |
| 58.6 | 1 | |
| 57.8 | 1 | |
| 57.7 | 1 | |
| 57.2 | 1 | |
| 56.5 | 1 | |
| 56 | 2 | |
| 55.9 | 1 | |
| 55.7 | 1 | |
| 55 | 1 | |
| 54.7 | 1 |
Humidity
Real number (ℝ)
High correlation 
| Distinct | 723 |
|---|---|
| Distinct (%) | 14.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.05612 |
| Minimum | 36 |
|---|---|
| Maximum | 128.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 36 |
|---|---|
| 5-th percentile | 45.1 |
| Q1 | 58.3 |
| median | 69.8 |
| Q3 | 80.3 |
| 95-th percentile | 97.905 |
| Maximum | 128.1 |
| Range | 92.1 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 15.863577 |
|---|---|
| Coefficient of variation (CV) | 0.22644098 |
| Kurtosis | -0.29031593 |
| Mean | 70.05612 |
| Median Absolute Deviation (MAD) | 10.9 |
| Skewness | 0.28052793 |
| Sum | 350280.6 |
| Variance | 251.65307 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 73 | 20 | 0.4% |
| 72.5 | 18 | 0.4% |
| 67.6 | 18 | 0.4% |
| 64.4 | 18 | 0.4% |
| 64.6 | 18 | 0.4% |
| 60.1 | 18 | 0.4% |
| 67.9 | 17 | 0.3% |
| 72.7 | 17 | 0.3% |
| 65.8 | 17 | 0.3% |
| 75.6 | 17 | 0.3% |
| Other values (713) | 4822 |
| Value | Count | Frequency (%) |
| 36 | 1 | < 0.1% |
| 36.1 | 1 | < 0.1% |
| 36.3 | 1 | < 0.1% |
| 36.9 | 1 | < 0.1% |
| 38.2 | 2 | |
| 38.3 | 3 | |
| 38.4 | 1 | < 0.1% |
| 38.5 | 1 | < 0.1% |
| 38.6 | 2 | |
| 38.7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 128.1 | 1 | |
| 124.7 | 1 | |
| 123 | 1 | |
| 122.3 | 1 | |
| 120.7 | 1 | |
| 120.5 | 2 | |
| 119.4 | 1 | |
| 117.3 | 1 | |
| 116.9 | 2 | |
| 116.3 | 1 |
PM2.5
Real number (ℝ)
High correlation 
| Distinct | 815 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.14214 |
| Minimum | 0 |
|---|---|
| Maximum | 295 |
| Zeros | 20 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.8 |
| Q1 | 4.6 |
| median | 12 |
| Q3 | 26.1 |
| 95-th percentile | 68.4 |
| Maximum | 295 |
| Range | 295 |
| Interquartile range (IQR) | 21.5 |
Descriptive statistics
| Standard deviation | 24.554546 |
|---|---|
| Coefficient of variation (CV) | 1.2190634 |
| Kurtosis | 13.033781 |
| Mean | 20.14214 |
| Median Absolute Deviation (MAD) | 8.8 |
| Skewness | 2.89091 |
| Sum | 100710.7 |
| Variance | 602.92572 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.5 | 38 | 0.8% |
| 1.1 | 37 | 0.7% |
| 2 | 36 | 0.7% |
| 0.7 | 35 | 0.7% |
| 0.4 | 33 | 0.7% |
| 2.3 | 32 | 0.6% |
| 0.3 | 32 | 0.6% |
| 2.8 | 32 | 0.6% |
| 1 | 31 | 0.6% |
| 2.5 | 31 | 0.6% |
| Other values (805) | 4663 |
| Value | Count | Frequency (%) |
| 0 | 20 | |
| 0.1 | 30 | |
| 0.2 | 26 | |
| 0.3 | 32 | |
| 0.4 | 33 | |
| 0.5 | 24 | |
| 0.6 | 27 | |
| 0.7 | 35 | |
| 0.8 | 29 | |
| 0.9 | 25 |
| Value | Count | Frequency (%) |
| 295 | 1 | |
| 240.1 | 1 | |
| 216.9 | 1 | |
| 204 | 1 | |
| 193.1 | 1 | |
| 186.7 | 1 | |
| 173.9 | 1 | |
| 173.2 | 1 | |
| 169.2 | 1 | |
| 168.6 | 2 |
PM10
Real number (ℝ)
High correlation 
| Distinct | 955 |
|---|---|
| Distinct (%) | 19.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.21836 |
| Minimum | -0.2 |
|---|---|
| Maximum | 315.8 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | -0.2 |
|---|---|
| 5-th percentile | 5.8 |
| Q1 | 12.3 |
| median | 21.7 |
| Q3 | 38.1 |
| 95-th percentile | 84.705 |
| Maximum | 315.8 |
| Range | 316 |
| Interquartile range (IQR) | 25.8 |
Descriptive statistics
| Standard deviation | 27.349199 |
|---|---|
| Coefficient of variation (CV) | 0.9050524 |
| Kurtosis | 10.273039 |
| Mean | 30.21836 |
| Median Absolute Deviation (MAD) | 11.2 |
| Skewness | 2.5348148 |
| Sum | 151091.8 |
| Variance | 747.97871 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.1 | 28 | 0.6% |
| 16.3 | 26 | 0.5% |
| 10.9 | 25 | 0.5% |
| 14.1 | 24 | 0.5% |
| 18.9 | 24 | 0.5% |
| 8.4 | 24 | 0.5% |
| 8.8 | 23 | 0.5% |
| 8 | 22 | 0.4% |
| 15.5 | 22 | 0.4% |
| 14.9 | 22 | 0.4% |
| Other values (945) | 4760 |
| Value | Count | Frequency (%) |
| -0.2 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 1.3 | 1 | < 0.1% |
| 1.5 | 2 | |
| 1.7 | 1 | < 0.1% |
| 1.9 | 4 | |
| 2 | 2 |
| Value | Count | Frequency (%) |
| 315.8 | 1 | |
| 261.5 | 1 | |
| 240 | 1 | |
| 221.6 | 1 | |
| 212.6 | 1 | |
| 209.8 | 1 | |
| 194.7 | 1 | |
| 190.7 | 1 | |
| 190 | 1 | |
| 188.2 | 1 |
NO2
Real number (ℝ)
High correlation 
| Distinct | 445 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.4121 |
| Minimum | 7.4 |
|---|---|
| Maximum | 64.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 7.4 |
|---|---|
| 5-th percentile | 13.595 |
| Q1 | 20.1 |
| median | 25.3 |
| Q3 | 31.9 |
| 95-th percentile | 43.1 |
| Maximum | 64.9 |
| Range | 57.5 |
| Interquartile range (IQR) | 11.8 |
Descriptive statistics
| Standard deviation | 8.8953564 |
|---|---|
| Coefficient of variation (CV) | 0.33679095 |
| Kurtosis | 0.24846135 |
| Mean | 26.4121 |
| Median Absolute Deviation (MAD) | 5.8 |
| Skewness | 0.63878267 |
| Sum | 132060.5 |
| Variance | 79.127365 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24.2 | 38 | 0.8% |
| 25.3 | 34 | 0.7% |
| 26.6 | 33 | 0.7% |
| 23.1 | 31 | 0.6% |
| 23 | 31 | 0.6% |
| 23.5 | 31 | 0.6% |
| 25.4 | 30 | 0.6% |
| 20.9 | 30 | 0.6% |
| 23.4 | 29 | 0.6% |
| 22.9 | 29 | 0.6% |
| Other values (435) | 4684 |
| Value | Count | Frequency (%) |
| 7.4 | 1 | < 0.1% |
| 8.5 | 1 | < 0.1% |
| 9.1 | 1 | < 0.1% |
| 9.2 | 1 | < 0.1% |
| 9.3 | 1 | < 0.1% |
| 9.9 | 2 | |
| 10 | 3 | |
| 10.1 | 2 | |
| 10.2 | 2 | |
| 10.3 | 2 |
| Value | Count | Frequency (%) |
| 64.9 | 1 | |
| 62.1 | 1 | |
| 59.3 | 2 | |
| 57.3 | 1 | |
| 56.8 | 1 | |
| 56.6 | 1 | |
| 56.4 | 1 | |
| 56.1 | 1 | |
| 56 | 1 | |
| 55.8 | 1 |
SO2
Real number (ℝ)
High correlation 
| Distinct | 348 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.01482 |
| Minimum | -6.2 |
|---|---|
| Maximum | 44.9 |
| Zeros | 5 |
| Zeros (%) | 0.1% |
| Negative | 30 |
| Negative (%) | 0.6% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | -6.2 |
|---|---|
| 5-th percentile | 2.3 |
| Q1 | 5.1 |
| median | 8 |
| Q3 | 13.725 |
| 95-th percentile | 23.6 |
| Maximum | 44.9 |
| Range | 51.1 |
| Interquartile range (IQR) | 8.625 |
Descriptive statistics
| Standard deviation | 6.7503034 |
|---|---|
| Coefficient of variation (CV) | 0.67403142 |
| Kurtosis | 1.3292471 |
| Mean | 10.01482 |
| Median Absolute Deviation (MAD) | 3.7 |
| Skewness | 1.1667723 |
| Sum | 50074.1 |
| Variance | 45.566596 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.7 | 61 | 1.2% |
| 5.9 | 57 | 1.1% |
| 4.5 | 55 | 1.1% |
| 4.9 | 54 | 1.1% |
| 5.3 | 53 | 1.1% |
| 6.3 | 53 | 1.1% |
| 5 | 52 | 1.0% |
| 6.4 | 49 | 1.0% |
| 4.6 | 48 | 1.0% |
| 5.1 | 48 | 1.0% |
| Other values (338) | 4470 |
| Value | Count | Frequency (%) |
| -6.2 | 1 | |
| -4.1 | 1 | |
| -3.4 | 1 | |
| -2.8 | 1 | |
| -1.9 | 1 | |
| -1.7 | 1 | |
| -1.4 | 1 | |
| -1.2 | 1 | |
| -0.6 | 2 | |
| -0.5 | 2 |
| Value | Count | Frequency (%) |
| 44.9 | 1 | |
| 42.3 | 1 | |
| 40.7 | 1 | |
| 40.5 | 1 | |
| 39.6 | 1 | |
| 38.7 | 1 | |
| 37.6 | 2 | |
| 36.8 | 2 | |
| 36.5 | 1 | |
| 36.2 | 1 |
CO
Real number (ℝ)
High correlation 
| Distinct | 265 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.500354 |
| Minimum | 0.65 |
|---|---|
| Maximum | 3.72 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0.65 |
|---|---|
| 5-th percentile | 0.88 |
| Q1 | 1.03 |
| median | 1.41 |
| Q3 | 1.84 |
| 95-th percentile | 2.53 |
| Maximum | 3.72 |
| Range | 3.07 |
| Interquartile range (IQR) | 0.81 |
Descriptive statistics
| Standard deviation | 0.54602667 |
|---|---|
| Coefficient of variation (CV) | 0.36393189 |
| Kurtosis | 0.20965338 |
| Mean | 1.500354 |
| Median Absolute Deviation (MAD) | 0.39 |
| Skewness | 0.8790677 |
| Sum | 7501.77 |
| Variance | 0.29814512 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.98 | 88 | 1.8% |
| 0.99 | 88 | 1.8% |
| 1.02 | 87 | 1.7% |
| 1.03 | 85 | 1.7% |
| 1.01 | 84 | 1.7% |
| 1.04 | 80 | 1.6% |
| 0.94 | 73 | 1.5% |
| 0.97 | 73 | 1.5% |
| 0.92 | 70 | 1.4% |
| 1 | 70 | 1.4% |
| Other values (255) | 4202 |
| Value | Count | Frequency (%) |
| 0.65 | 1 | < 0.1% |
| 0.68 | 1 | < 0.1% |
| 0.69 | 1 | < 0.1% |
| 0.72 | 4 | 0.1% |
| 0.73 | 4 | 0.1% |
| 0.74 | 4 | 0.1% |
| 0.75 | 2 | < 0.1% |
| 0.76 | 9 | |
| 0.77 | 9 | |
| 0.78 | 10 |
| Value | Count | Frequency (%) |
| 3.72 | 1 | < 0.1% |
| 3.67 | 1 | < 0.1% |
| 3.65 | 1 | < 0.1% |
| 3.61 | 1 | < 0.1% |
| 3.54 | 1 | < 0.1% |
| 3.48 | 2 | |
| 3.4 | 1 | < 0.1% |
| 3.37 | 2 | |
| 3.36 | 3 | |
| 3.35 | 2 |
Proximity_to_Industrial_Areas
Real number (ℝ)
High correlation 
| Distinct | 179 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.4254 |
| Minimum | 2.5 |
|---|---|
| Maximum | 25.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 2.5 |
|---|---|
| 5-th percentile | 3.5 |
| Q1 | 5.4 |
| median | 7.9 |
| Q3 | 11.1 |
| 95-th percentile | 14.4 |
| Maximum | 25.8 |
| Range | 23.3 |
| Interquartile range (IQR) | 5.7 |
Descriptive statistics
| Standard deviation | 3.6109437 |
|---|---|
| Coefficient of variation (CV) | 0.42857831 |
| Kurtosis | -0.23374824 |
| Mean | 8.4254 |
| Median Absolute Deviation (MAD) | 2.8 |
| Skewness | 0.46975156 |
| Sum | 42127 |
| Variance | 13.038915 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.1 | 112 | 2.2% |
| 10.2 | 112 | 2.2% |
| 10.3 | 107 | 2.1% |
| 10.1 | 105 | 2.1% |
| 5.2 | 99 | 2.0% |
| 5.4 | 97 | 1.9% |
| 10.4 | 96 | 1.9% |
| 5.6 | 90 | 1.8% |
| 10.5 | 88 | 1.8% |
| 11.1 | 86 | 1.7% |
| Other values (169) | 4008 |
| Value | Count | Frequency (%) |
| 2.5 | 14 | 0.3% |
| 2.6 | 28 | |
| 2.7 | 24 | |
| 2.8 | 26 | |
| 2.9 | 16 | 0.3% |
| 3 | 21 | 0.4% |
| 3.1 | 12 | 0.2% |
| 3.2 | 19 | 0.4% |
| 3.3 | 23 | 0.5% |
| 3.4 | 58 |
| Value | Count | Frequency (%) |
| 25.8 | 1 | < 0.1% |
| 25.2 | 1 | < 0.1% |
| 24.8 | 1 | < 0.1% |
| 23.4 | 1 | < 0.1% |
| 21.8 | 1 | < 0.1% |
| 21.7 | 1 | < 0.1% |
| 21.6 | 1 | < 0.1% |
| 21.5 | 1 | < 0.1% |
| 20.8 | 1 | < 0.1% |
| 20 | 3 |
Population_Density
Real number (ℝ)
High correlation 
| Distinct | 683 |
|---|---|
| Distinct (%) | 13.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 497.4238 |
| Minimum | 188 |
|---|---|
| Maximum | 957 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 188 |
|---|---|
| 5-th percentile | 249 |
| Q1 | 381 |
| median | 494 |
| Q3 | 600 |
| 95-th percentile | 765 |
| Maximum | 957 |
| Range | 769 |
| Interquartile range (IQR) | 219 |
Descriptive statistics
| Standard deviation | 152.75408 |
|---|---|
| Coefficient of variation (CV) | 0.30709042 |
| Kurtosis | -0.47380952 |
| Mean | 497.4238 |
| Median Absolute Deviation (MAD) | 109.5 |
| Skewness | 0.20423108 |
| Sum | 2487119 |
| Variance | 23333.81 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 494 | 24 | 0.5% |
| 454 | 23 | 0.5% |
| 543 | 22 | 0.4% |
| 471 | 20 | 0.4% |
| 501 | 20 | 0.4% |
| 538 | 20 | 0.4% |
| 511 | 19 | 0.4% |
| 438 | 19 | 0.4% |
| 506 | 19 | 0.4% |
| 485 | 19 | 0.4% |
| Other values (673) | 4795 |
| Value | Count | Frequency (%) |
| 188 | 1 | < 0.1% |
| 189 | 3 | |
| 191 | 1 | < 0.1% |
| 193 | 2 | |
| 194 | 2 | |
| 196 | 3 | |
| 197 | 1 | < 0.1% |
| 198 | 2 | |
| 199 | 3 | |
| 200 | 4 |
| Value | Count | Frequency (%) |
| 957 | 1 | |
| 951 | 1 | |
| 939 | 1 | |
| 937 | 1 | |
| 934 | 2 | |
| 933 | 1 | |
| 927 | 2 | |
| 924 | 1 | |
| 923 | 1 | |
| 914 | 1 |
Air Quality
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Good | |
|---|---|
| Moderate | |
| Poor | |
| Hazardous |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 5.7 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Moderate |
|---|---|
| 2nd row | Moderate |
| 3rd row | Moderate |
| 4th row | Good |
| 5th row | Good |
Common Values
| Value | Count | Frequency (%) |
| Good | 2000 | |
| Moderate | 1500 | |
| Poor | 1000 | |
| Hazardous | 500 | 10.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| good | 2000 | |
| moderate | 1500 | |
| poor | 1000 | |
| hazardous | 500 | 10.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 8000 | |
| d | 4000 | |
| e | 3000 | 10.5% |
| r | 3000 | 10.5% |
| a | 2500 | 8.8% |
| G | 2000 | 7.0% |
| M | 1500 | 5.3% |
| t | 1500 | 5.3% |
| P | 1000 | 3.5% |
| H | 500 | 1.8% |
| Other values (3) | 1500 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 28500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 8000 | |
| d | 4000 | |
| e | 3000 | 10.5% |
| r | 3000 | 10.5% |
| a | 2500 | 8.8% |
| G | 2000 | 7.0% |
| M | 1500 | 5.3% |
| t | 1500 | 5.3% |
| P | 1000 | 3.5% |
| H | 500 | 1.8% |
| Other values (3) | 1500 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 28500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 8000 | |
| d | 4000 | |
| e | 3000 | 10.5% |
| r | 3000 | 10.5% |
| a | 2500 | 8.8% |
| G | 2000 | 7.0% |
| M | 1500 | 5.3% |
| t | 1500 | 5.3% |
| P | 1000 | 3.5% |
| H | 500 | 1.8% |
| Other values (3) | 1500 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 28500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 8000 | |
| d | 4000 | |
| e | 3000 | 10.5% |
| r | 3000 | 10.5% |
| a | 2500 | 8.8% |
| G | 2000 | 7.0% |
| M | 1500 | 5.3% |
| t | 1500 | 5.3% |
| P | 1000 | 3.5% |
| H | 500 | 1.8% |
| Other values (3) | 1500 | 5.3% |
Interactions
Correlations
| Air Quality | CO | Humidity | NO2 | PM10 | PM2.5 | Population_Density | Proximity_to_Industrial_Areas | SO2 | Temperature | |
|---|---|---|---|---|---|---|---|---|---|---|
| Air Quality | 1.000 | 0.751 | 0.411 | 0.541 | 0.321 | 0.240 | 0.437 | 0.636 | 0.528 | 0.520 |
| CO | 0.751 | 1.000 | 0.551 | 0.709 | 0.585 | 0.379 | 0.573 | -0.772 | 0.688 | 0.689 |
| Humidity | 0.411 | 0.551 | 1.000 | 0.475 | 0.385 | 0.255 | 0.388 | -0.487 | 0.443 | 0.452 |
| NO2 | 0.541 | 0.709 | 0.475 | 1.000 | 0.485 | 0.314 | 0.486 | -0.645 | 0.573 | 0.583 |
| PM10 | 0.321 | 0.585 | 0.385 | 0.485 | 1.000 | 0.915 | 0.389 | -0.530 | 0.473 | 0.476 |
| PM2.5 | 0.240 | 0.379 | 0.255 | 0.314 | 0.915 | 1.000 | 0.250 | -0.337 | 0.306 | 0.306 |
| Population_Density | 0.437 | 0.573 | 0.388 | 0.486 | 0.389 | 0.250 | 1.000 | -0.506 | 0.455 | 0.460 |
| Proximity_to_Industrial_Areas | 0.636 | -0.772 | -0.487 | -0.645 | -0.530 | -0.337 | -0.506 | 1.000 | -0.627 | -0.628 |
| SO2 | 0.528 | 0.688 | 0.443 | 0.573 | 0.473 | 0.306 | 0.455 | -0.627 | 1.000 | 0.568 |
| Temperature | 0.520 | 0.689 | 0.452 | 0.583 | 0.476 | 0.306 | 0.460 | -0.628 | 0.568 | 1.000 |
Missing values
Sample
| Temperature | Humidity | PM2.5 | PM10 | NO2 | SO2 | CO | Proximity_to_Industrial_Areas | Population_Density | Air Quality | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 29.8 | 59.1 | 5.2 | 17.9 | 18.9 | 9.2 | 1.72 | 6.3 | 319 | Moderate |
| 1 | 28.3 | 75.6 | 2.3 | 12.2 | 30.8 | 9.7 | 1.64 | 6.0 | 611 | Moderate |
| 2 | 23.1 | 74.7 | 26.7 | 33.8 | 24.4 | 12.6 | 1.63 | 5.2 | 619 | Moderate |
| 3 | 27.1 | 39.1 | 6.1 | 6.3 | 13.5 | 5.3 | 1.15 | 11.1 | 551 | Good |
| 4 | 26.5 | 70.7 | 6.9 | 16.0 | 21.9 | 5.6 | 1.01 | 12.7 | 303 | Good |
| 5 | 39.4 | 96.6 | 14.6 | 35.5 | 42.9 | 17.9 | 1.82 | 3.1 | 674 | Hazardous |
| 6 | 41.7 | 82.5 | 1.7 | 15.8 | 31.1 | 12.7 | 1.80 | 4.6 | 735 | Poor |
| 7 | 31.0 | 59.6 | 5.0 | 16.8 | 24.2 | 13.6 | 1.38 | 6.3 | 443 | Moderate |
| 8 | 29.4 | 93.8 | 10.3 | 22.7 | 45.1 | 11.8 | 2.03 | 5.4 | 486 | Poor |
| 9 | 33.2 | 80.5 | 11.1 | 24.4 | 32.0 | 15.3 | 1.69 | 4.9 | 535 | Poor |
| Temperature | Humidity | PM2.5 | PM10 | NO2 | SO2 | CO | Proximity_to_Industrial_Areas | Population_Density | Air Quality | |
|---|---|---|---|---|---|---|---|---|---|---|
| 4990 | 46.8 | 93.8 | 11.8 | 25.4 | 33.8 | 28.7 | 3.27 | 3.7 | 589 | Hazardous |
| 4991 | 31.8 | 80.2 | 22.4 | 34.1 | 29.7 | 4.9 | 1.22 | 9.4 | 580 | Moderate |
| 4992 | 29.8 | 56.7 | 6.8 | 14.0 | 23.0 | 4.5 | 1.10 | 11.4 | 567 | Good |
| 4993 | 34.9 | 77.7 | 32.3 | 47.1 | 17.4 | 11.5 | 1.63 | 8.8 | 541 | Moderate |
| 4994 | 31.1 | 61.0 | 27.1 | 31.1 | 13.0 | 3.8 | 0.98 | 13.4 | 278 | Good |
| 4995 | 40.6 | 74.1 | 116.0 | 126.7 | 45.5 | 25.7 | 2.11 | 2.8 | 765 | Hazardous |
| 4996 | 28.1 | 96.9 | 6.9 | 25.0 | 25.3 | 10.8 | 1.54 | 5.7 | 709 | Moderate |
| 4997 | 25.9 | 78.2 | 14.2 | 22.1 | 34.8 | 7.8 | 1.63 | 9.6 | 379 | Moderate |
| 4998 | 25.3 | 44.4 | 21.4 | 29.0 | 23.7 | 5.7 | 0.89 | 11.6 | 241 | Good |
| 4999 | 24.1 | 77.9 | 81.7 | 94.3 | 23.2 | 10.5 | 1.38 | 8.3 | 461 | Moderate |